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Abstract —Distributed desynchronization algorithms are key to 
wireless sensor networks as they allow for medinm access control 
in a decentralized manner. In this paper, we view desynchro¬ 
nization primitives as iterative methods that solve optimization 
problems. In particular, by formalizing a well established desyn¬ 
chronization algorithm as a gradient descent method, we establish 
novel npper bonnds on the number of iterations required to reach 
convergence. Moreover, by nsing Nesterov’s accelerated gradient 
method, we propose a novel desynchronization primitive that 
provides for faster convergence to the steady state. Importantly, 
we propose a novel algorithm that leads to decentralized time- 
synchronous multichannel TDMA coordination by formulating 
this task as an optimization problem. Onr simulations and 
experiments on a densely-connected IEEE 802.15.4-based wireless 
sensor network demonstrate that our scheme provides for faster 
convergence to the steady state, robnstness to hidden nodes, 
higher network thronghput and comparable power dissipation 
with respect to the recently standardized IEEE 802.I5.4e-20I2 
time-synchronized channel hopping (TSCH) scheme. 

Index Terms —Medinm access control, desynchronization, gra¬ 
dient methods, decentralized multichannel coordination. 

I. Introduction 

N WIRELESS sensor networks (WSNs), achieving and 
maintaining (de)synchronization among the nodes supports 
various functionalities, including data aggregation, duty cy¬ 
cling, and cooperative communications. In particular, devising 
protocols that perform desynchronization at the medium access 
control (MAC) layer is key in achieving fair TDMA scheduling 
among the nodes in a channel l^-lTTl. 

In order to extend fair TDMA scheduling to large-scale 
networks, protocols that achieve (de)synchronization across 
multiple channels a, Q are required. Typical approaches are 
infrastructure-based (i.e., centralized), as they use a coordi¬ 
nation channel and/or node and a global clock (e.g., via a 
GPS system) ||5|. Channel hopping is been accepted as a good 
solution for MAC-layer coordination for dense WSN topolo¬ 
gies. According to channel hopping, nodes hop between the 
available channels of the physical layer such that they are not 
constantly using a channel with excessive interference. Eorm- 
ing the state-of-the-art, the time-synchronized channel hopping 
(TSCH) in protocol is now part of the IEEE 802.15.4e- 
2012 standard fSl. In TSCH, each node reserves timeslots 
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within the predefined slotframe interval and within the 16 
channels of IEEE 802.15.4. However, filling up the avail¬ 
able slots follows an advertising request-and-acknowledgment 
(RQ/ACK) process on a coordination channel. This channel is 
prone to interference and self-inflicted collisions when nodes 
advertise slots aggressively. Moreover, when nodes leave the 
network, their slots may remain unoccupied for long periods 
until another advertisement process reassigns them to other 
nodes. This limits the bandwidth usage per channel and does 
not allow for fast convergence to the steady stat^H It is also 
important to note that TSCH requires a coordinator to maintain 
global time synchronization a, Q. 

To achieve infrastructure-less (i.e., decentralized) WSN 
MAC-layer coordination, distributed (de)synchro-nization al¬ 
gorithms have attracted a lot of interest 0, a, ||9l- lfT7l . 
These algorithms are inspired by biological agents modeled 
as pulse-coupled oscillators (PCOs) 0, m, d, namely, as 
timing mechanisms following a periodic pulsing (i.e., beacon 
packet transmission at the MAC) that is updated via the 
timings of pulses heard from other nodes. 

Most work on distributed (de)synchronization is based on 
the PCO dynamics model introduced by Mirollo and Strogatz 
d, and derives several algorithms with properties of practical 
relevance to WSN deployments, namely; (i) limited listening 
a, d, d, a property that is imperative for low energy 
consumption in wireless transceivers; (ii) solutions amenable 
to multi-hop network topologies and the existence of hidden 
nodes 0, d, d; (Hi) solutions scalable to large groups 
of nodes a, d; and (iv) modihcations that lead to fast 
convergence to steady state d-d, d- PCO-based syn¬ 
chronization methods have also been interpreted as consensus 
algorithms for multi-agent systems ll^ - ll2^ . The work in 
studied synchronization of networked oscillators under 
heterogeneous time-delays and varying topologies. In ll24ll . the 
synchronization of networked oscillators was modeled using 
coupled discrete-time phase locked loops. 

Regarding the study of the convergence speed of desynchro¬ 
nization algorithms, mostly estimates based on simulations 
or empirical measurements have been derived. In effect, only 
lower bounds d, d, order-of-convergence estimates a, 
0, d and operational estimates ll25l have been established. 
However, no upper bounds are currently known for the conver¬ 
gence speed of desynchronization algorithms, despite the fact 
that such bounds provide for worst-case guarantees of time and 

^Both high network throughput and quick convergence are important for 
WSNs that operate with a periodic wake-up cycle (or are event-triggered) and 
must quickly converge to a steady operational state and transmit high data 
volumes before being re-suspended. 
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Fig. 1. Phase update of node i according to the Desync algorithm: node i — 
1 fires at time ti—i, and node i updates its phase from 6i{ti—\) to 
towards the average of the phases of nodes i — 1 and i+l, its phase neighbors. 


energy consumption to achieve the state of desynchrony. Fur¬ 
thermore, despite the plethora of works on PCOs, the problem 
of extending distributed (de)synchronization algorithms to the 
multichannel case (which is key in today’s wireless networks) 
has received limited attention. A preliminary attempt was done 
in 0, where desynchronization was independently applied per 
channel. The limitation of the scheme in Q is that, since the 
nodes in different channels are not synchronized, when a node 
switches channels convergence needs to be established anew. 

In this work, we view the problem of desynchronization 
as an optimization problem. In particular, we show that a 
minor modification of the well established Desync algorithm 
0, 01 is the gradient descent method applied to a specific 
optimization problem. Although desynchronization can also 
be viewed from a consensus perspective || 23 , the optimiza¬ 
tion approach is more powerful as it allows deriving faster 
algorithms ll2l . ll27l . Our contributions are as follows; 

• We establish novel upper bounds on the convergence rate 
of the Desync process. Such bounds can yield reliable 
estimates of worst-case energy consumption and time re¬ 
quired for convergence, which are important for systems 
that operate under delay and/or energy constraints. 

• We propose a novel desynchronization algorithm based 
on Nesterov’s accelerated gradient method ll2^ . 1291 . 
We show, both theoretically and experimentally, that the 
proposed algorithm leads to faster convergence to steady 
state than the conventional Desync algorithm 0, nni. 

• We propose a novel distributed multichannel method 
that jointly performs synchronization across channels 
and desynchronization within each channel. Contrary to 
O, the proposed algorithm leads to time-synchronous 
multichannel TDMA coordination (where nodes allocated 
the same timeslot in adjacent channels are synchronized). 
In this way, nodes can swap channels (thus, avoiding 
persistent interference in certain channels and achieving 
higher connectivity) without the network exiting the 
steady state. 

• Finally, via simulations and experiments using a real 
WSN deployment abiding by the IEEE802.15.4 stan¬ 
dard, we show that our approach leads to decentralized 
time-synchronous multichannel MAC-layer coordination 
that achieves higher network throughput compared to 
the state-of-the-art TSCH |(5l protocol, while incurring 
comparable power consumption. 

The paper continuous as follows: Section HI] presents the 
background on PCO methods, while Section |III| derives our 


upper bound for the desynchronization process and proposes 
our novel accelerated desynchronization algorithm. Section HVl 
presents our novel formulation of multichannel coordination. 
Simulations and experiments using a WSN deployment are 
given in Section |Vl while Section |VT| concludes the paper. 


II. Background on Pulse-Coupled Oscillators 


Consider a fully-connected WSN comprising n nodes, each 
acting as a pulse-coupled oscillator M- When a node does 
not interact with others, it broadcasts a fire message or 
pulse periodically. This is modeled by assigning to node i 
a phase Ofit), whose value at time t is given by 0, ifT^ 


= ^ + modi, (1) 

where G [0,1] is the phase offset of node i and mod 1 
denotes the modulo operation with respect to unity. Pig. [T] 
illustrates O graphically: the phase 9fit) of node i can be 
seen as a bead moving clockwise on a circle, whose origin 
coincides both with 0 and 1 0, na, Ea, eoi. if is 
constant, which happens when the nodes do not interact, node i 
broadcasts a fire message every T time units, when 9fit) = 1, 
and then sets its phase to zero. When the nodes interact, 
e.g., by listening to each others’ messages, they modify 
their phases (specifically, their phase offsets), according to an 
update equation that expresses the PCO dynamics ifia . One of 
the most prominent PCO algorithms for desynchronization at 
the MAC layer of WSNs is the Desync algorithm 0 , ifTOl . 
In Desync, the nodes are ordered according to their initial 
phases: 0 < 0i(O) < 02(0) < ••• < 0rt(O) < 1. Assuming 
perfect beacon transmission and reception, the order of the 
firings in Desync will remain the same 0 , IfTOl . The phase 
9i of each node i is updated based on the phases 0i_i and 0^+1 
of its phase neighbors, nodes i — 1 and i-fl, respectively. This 
is illustrated in Pig. [T] immediately after node / — 1 transmits 
a fire message, node i modifies its phase according to 


9'fiti-i) = {l-a)9fiti_i)-\-a 


9i-i{ti-i) + 0i+i(/i_i) 
2 


, ( 2 ) 


where ti-i is the time instant in which node i — I fires, i.e., 
0i_i(/i_i) = 1, and i = 1,2,... ,n, with periodic extension at 
the boundaries. The jump-phase parameter a G (0,1) controls 
the phase increment 0, Ho). 

When node i updates its phase, it has stale knowledge of the 
phase of node i -f 1, namely, it only knows the previous value 
of 0i_|_i and not the current one. This is because node i + 1 
modified its phase when node i fired, but the value of the 
new phase has not been “announced” yet noi. In Desync, 
each node: (i) updates its phase once in each firing round (we 
say that a firing round is completed when each node in the 
network has fired exactly once); (ii) does not need to know 
the total number of nodes, n, in the network; (Hi) requires 
limited listening, as only the messages from the two phase 
neighbors are required. These features make Desync quite 
popular 0, IfTOl . Por a fully-connected network, it has been 
shown that 0 converges to the state of desynchrony at time t, 
after which the interval between consecutive firings is T jn up 
to a small threshold e. Under partial connectivity or hidden 
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nodes, convergence is still achieved under a wide variety of 
topologies, but the node firings may not be equidistant It 
has been conjectured via simulations ifTOl , ll^ that Desync 
converges to desynchrony (i.e., perfect TDMA scheduling) in 


?'Desync — O 



(3) 


firing rounds. Recently, under the assumption of uniformly 
distributed initial firing phases, an operational estimate for 
the number of firing rounds for the Desync algorithm’s 
convergence was derived Il25l . However, no upper bounds are 
known for the desynchronization process. 


now see how the update rule ^ translates into the updates 
of the phase offsets. Replacing O into (E) at firing round 
(iteration) k, we obtain 




ti-i 

Y 

= (!-- 




tj-l 

iY 

U-i 


2 L T 




^i — 1 ^ 


Ak-l) 

A +1 


— + (1 - a)(j)\ + a- - - 


III. Desync as a Gradient Method 

We start by showing that, considering a fully-connected 
network, a minor modification of Desync El, Eol can be 
viewed as a gradient descent method solving an optimization 
problem. Then, we establish novel convergence properties of 
the resulting method and derive a new accelerated desynchro¬ 
nization primitive. 

Staleness of Desync: Fig. |2] shows five consecutive con¬ 
figurations of the phases of the nodes of a network with four 
nodes. The purpose is to illustrate how the phases of the nodes 
are updated in the first iteration of Desync El, iflOl and to 
highlight our minor modification. For simplicity, we omit the 
time dependence of the phases, but use a superscript to indicate 
how many times they have been updated. In Fig. |2(a)| no 
firing has yet occurred. The first update occurs when node 2 
fires, whereby node 3 updates its phase from to 0^^^ [see 
Fig. |2(b)| . According to dUl, this update requires knowing 02 
(which is equal to 1 because node 2 is firing) and 04 (which 
is known because node 4 was the first to fire). The second 
phase update occurs in Fig. |2(c)| node 1 fires, and node 2 
updates its phase from 02 °^ to 02^\ According to @, this 
update requires the value of 0i (known because node 1 is 
firing) and 0^. The current value of 0^ (actually, 0^^^) is not 
known because node 3 has not fired since it updated its phase. 
Therefore, node 2 will use ^ 3 °^ rather than This is why 
we say that Desync is stale: each update uses stale versions 
of the phases. In step (d), node 1 updates its phase and also 
uses a stale version of the phase of node 2. Finally, in step (e), 
node 4 updates its phase using a stale version of the phase of 
node 1. We assume, however, that in contrast with the other 
nodes, this update uses the value 0^^^ (in gray) and not 

Assumption 1. In Desync, node n updates its phase at 
iteration k using 0l^_i^ in place 0 / 0 ^^^. 

Via Assumption [T] all updates in Fig. |2] use the initial 
values 0^°^, 0^\ 03 °\ and 04 *^^ In practice, this assumption 
does not lead to a discernible difference in the performance 
of Desync. 

Vector notation: Suppose we are in the fc-th firing round, 
i.e., all nodes have updated their phases k—1 times. We have 
already mentioned how the firing of a node i, say at time ti, 
enables other nodes to determine the current value of 
in ([Til: = 1 — U/T. Knowing this, each node can 

determine the value of 0^^~^\t) for any time instant. We will 


Y) 


= {l-a)4>f 


Eliminating the term U-i/T, we get: 

—__L±i— jjj a strict sense, this expression is only valid 
for z = 2 ,..., n — 1 as the updates for nodes 1 and n require 
a correcting term to compensate the fact that each 0 wraps 
around 1. Therefore, the updates for all nodes are 


= (1 - + I (4"-') + - 1) (4) 

(j)Y = (1 _ , 2 < z < n - 1 

(5) 

</.« = (1 - + I + 4"-') + 1) . (6) 


Without Assumption [T] in (jSli would be replaced 

(k) 

with (j)\ . It is, however, this assumption that enables us to 
write (|4]i-(|6]l in vector form: 




1 — a 

a 

2 



0 

a 

2 


0 

0 


a 

2 


0 


0 (^- 1 ) 



L f 0 0 ... f 

where ,..., (j)^n ^) 

containing the phases of all the nodes at iteration k, and 
d := (1, 0,..., 0, —1) £ M". Equation (|3 has the format 
of the updates usually found in the discrete-time consensus 
literature ll2^ . OTI . ll^ . In particular, the matrix in (|7]) 
can be seen as the Perron matrix of a network with a ring 
topology and the vector d can be seen as an input bias ll23ll . 
This observation can be used to provide upper bounds on 
the convergence rate of (0. However, one can view O as 
an algorithm solving an optimization problem since, besides 
also providing upper bounds, this interpretation enables the 
derivation of an accelerated version of desynchronization. This 
interpretation is formalized next. 


1 — a 
G R’ 


(7) 

is a vector 


Proposition 1. Let denote the 

phases of all nodes at firing round k. If Assumption [7] holds, 
then Desync @ and © is the steepest descent method 
applied to 


minimize 

</> 


9{4>) ■= 2 11-^^ “ 


( 8 ) 


where v = l/zz, 1„ G R" is the vector of ones, e„ = 
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(a) 


(b) 


(c) 


(d) 


(e) 


Fig. 2. Updates during the first iteration of Desync in a 4-node network: (a) initial phases; no firing has occun'ed yet; (b) the first update occurs when 
node 2 fires and after nodes 4 and 3 have fired. The firing of node 2 causes node 3 to update 9^^ to 9^\ In the remaining steps, node i fires and node j 
updates its phase, where {i, j) is (1, 2) in (c), (4,1) in (d), and (3, 4) in (e). All phases are updated as a function of the initial values, i.e., although some 
of the phases have already changed, the updates use always , 9^\ or 9^\ and not the new values. 


(0, 0,..., 0,1) G R", and 

■-1 1 0 0 ... 0 

0 -11 0 ... 0 


D = 


0 

1 


G R 


nxn 


0 0-11 
0 0 0 -1 


Specifically, the updates in (|2ll can be written as 


Corollary 1. Every limit point of the sequence produced by 
the Desync algorithm © with a G (0,1) is a stationary 
point of (HI). 

(9) Proof: The proof is given in Appendix ■ 

Corollary 2. Let represent the vector of initial phases, 
and let cfi* be any solution of d^i. Suppose 0„ < < 1„. 

Then, the number of firing rounds, td, that DeSYNC (IHl-dll) 
requires in order to generate a point (p that has accuracy 

( 10 ) 

e := is upper bounded as 


Proof: Since = 0„, we have 

\/g{(p) = D^{D(j}-vln + en) = D^Dfi + d, (11) 

where d = D^en is the vector that appears in ©. Therefore, 
the steepest descent applied to dH) yields 

0(fc) =</,('=-!) 

= - fid 

^{In-fiD'^D)cf,^'^-^^ -fid, (12) 

where is the identity matrix in M". Replacing fi = a/2, 
we obtain 

= (J„ - , (13) 

The last equation is exactly ©. ■ 

We set u = i in (0 to emphasize that the goal of Desync 
is to disperse the n phases throughout [0,1]. However, any 
other value for v would lead to the same update rule, since 
the gradient of the objective function does not depend on u; 
see (HU) in the proof. This confirms the fact that Desync does 
not require the knowledge of the number of nodes, n, in the 
network mni . Notice also that D is not full rank; therefore, the 
objective of ([U is not strictly convex. Indeed, the nullspace 
of D IS ’ z LJ Consccjuditly, if cj) is 3. 

solution of (©, so is <fi + zln for any z G R. We notice 
that the interpretation of Proposition 1 is akin to the one that 
views consensus algorithms as gradient descent methods for 
minimizing where 9i is the observation of 

agent i ||26l, 1331. 

This interpretation of Desync provides for: (i) an alterna¬ 
tive way to establish the values of a for which convergence 
holds, and (ii) an upper bound on the number of the firing 
rounds until convergence. 


2a(l — a 




< 


1 


6na(l — a) 


7 9 

-n^ + 3n + 4 


(14) 



(15) 


Proof: The proof is given in Appendix 0 ■ 

Corollary 1 confirms Theorem 1 in Eol regarding the sta¬ 
bility and convergence of Desync, albeit using different tools 
and without requiring simulations to illustrate the avoidance 
of limit cycles. Corollary 2 complements the existing order- 
of-convergence estimate of © and the operational estimates 
derived by Buranapanichkit et al. 1251 by deriving an upper 
bound for the firing rounds to achieve convergence. Such an 
upper bound allows for reliable estimates of worst-case energy 
consumption and time, expressed in number of firing rounds or 
iterations, required to reach convergence. These estimates are 
important for systems that operate under delay and/or energy 
constraints. Notice that the bound in (ITST ) is a function of 
known system parameters, namely, the number of nodes n, 
the jump-phase parameter a, the tolerance parameter e, and 
the evaluation of g{-) on the initial phase vector (the latter 
can be ignored yielding a looser bound). 

The Fast-Desync algorithm based on Nesterov: A key 
advantage of viewing desynchronization as an optimization 
problem is that we can create new primitives that converge to 
desynchrony much faster. Particularly, we can use Nesterov’s 
fast gradient algorithm ll28l . Il29l (here we use the adaptation 
in 01): 


^ (16a) 
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where € K" is an auxiliary vector. Nesterov’s method is 
applicable under the same assumptions as the steepest descent, 
i.e., when g is continuously differentiable and its gradient is 
Lipschitz continuous with constant L. However, it requires 
0 < /3 < 1/L rather than 0 < /3 < ^jL. hx the expense 
of small extra memory and computation, Nesterov’s method 
takes 0(1 /y^) iterations to produce a point 4> that satisfies 
g{4>) — g{4>*) < e, where cf)* minimizes g. Recall that the 
steepest descent takes 0(l/e) to produce such a point [cf. 
mi We shall show that this improved performance in terms 
of bounds is also observed experimentally. Note that 
converge to the same point, i.e., —>■ 0 as 

k ^ oo. More importantly, Nesterov showed in ll2^ that (fThl l 
has optimal convergence rate among first-order methods, i.e., 
methods that use information about first-order derivatives only, 
possibly from all past iterations. 

We propose applying Nesterov’s algorithm (I16al i- (ll6bl i to 
solve (O. This yields a primitive that we call Fast-Desync. 
Node i = 1,..., n holds two variables (j)i and which are 
updated at iteration k as 

= (1 - a)/rf -f - dl) (17a) 

= ' + (Hb) 


where di = 1, = —1, and di = 0 for i = 2,..., n — 1. 

Note that (I17ab is identical to the Desync updates (|4li-(|6ll. 
The only detriment is that each node needs an extra memory 
register to store which is used in (I17bl i. and perform 

the extra computations in (I17bl) . Under this modification, the 
following holds: 


Corollary 3. Let a € (0,1/2] and let 0„ < < 1„ 

represent the vectors of initial phases. Let also (f* be any 
solution of (O. Then, the number of firing rounds that FaST- 
DesynC (I17al) - (ll7bl l requires to generate a point cj) that has 
accuracy e := g{4>) is upper bounded as 


rpD < - </>* 


/a e' 


< 


3nae 


n'^ + 3n + 4 


(18) 

(19) 


Proof: The proof is given in Appendix ■ 

Contrasting (ITsT l and ([T9l l we notice that Fast-Desync 
allows for significant reduction in the order-of-iterations for 
convergence compared to Desync, particularly, 0{^\fnje) 
versus Oinje), respectively. 


IV. Extension To Decentralized Multichannel 
Coordination 

We now describe our algorithm that jointly applies syn¬ 
chronization across channels and desynchronization in each 
channel. We assume that all nodes can receive all fire message 
broadcasts in their channel. We will show experimentally, 
however, that our proposal works even for densely-connected 
WSNs (when some nodes cannot be reached by others), as 
Desync still converges in such cases 12. We first describe 
our protocol. 




Fig. 3. (a) Initial random state of n = 14 nodes in C = 4 channels; 

(b) steady state of the proposed protocol with ric = 3 nodes for channels 
c = 1 and c = 2, and ric = 4 nodes in channels c = 3 and c = 4. The 
Desync nodes (in white) allow for intra-channel desynchronization, while the 
Sync nodes (in grey) provide for cross-channel synchronization. Nodes that 
belong to balanced channel and that fire synchronously can swap channels. 
The horizontal position of a node indicates the firing moment. 


A. Proposed Decentralized Multichannel MAC-layer Coordi¬ 
nation 

Let a WSN comprise n nodes that are initially randomly 
distributed in C channels [see Fig. Oa)] —for example, the 
C = 16 channels of the IEEE 802.15.4 standard 135), IMl. 
The maximum achievable throughput per node is obtained 
when the nodes are uniformly distributed across the available 
channels and a perfect TDMA scheduling is reached in each 
channel. When the total number of nodes in the network, n, is 
divisible by C our protocol will lead to ric = ^ nodes being 
present in each channel, alternatively, rtc = { , \^~\ } 

nodes will be present in each channel, as shown in Eig. [2bf 

Existing mechanisms, such as the one in Q, can take 
place during convergence to balance the number of nodes. 
Specifically, a node lying in channel c may switch to channel 
c+1 (with cyclic extension at the border), if it detects that less 
nodes are present there. Detection of the number of nodes in 
a channel is possible by integrating this information in the fire 
messages transmitted by the nodes. In Q, in order to detect 
the number of nodes in channel c+1, nodes within channel 
c proactively switched channels for short time intervals Q- 
Here, however, we follow a different approach, which is akin 
to the proposed algorithm. In particular, a single node (which 
we later call SYNC) lying in channel c is elected to listen for 
fire messages in channel c+ 1. This specific node may jump to 
the next channel if it detects that less nodes are present there. 
When a SYNC node jumps from one channel to the next, both 
channels are set to elect their SYNC nodes anew. In order 
to avoid a race condition, where nodes continuously jump 
channels, the following conditions are defined for channel 
switching: 

J ric - nc+i > 1, if c e [1, C) 

(ric ^c-t-i c — C 

where ric denotes the number of nodes present in channel 
c 

c, with n = Uc- The switching rule and conditions ensure 

C—1 

that, after a few firing periods, there will be ric G {) [§] } 
nodes in each channel c. 

When the channels have been balanced, the proposed it¬ 
erative joint synchronization-desynchronization algorithm is 
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applied. By considering that each node acts as a pulse-coupled 
oscillator with a period of T seconds, our novel algorithm 
(see Section lIV-Bl i leads to decentralized multichannel round- 
robin scheduling. The nodes in each channel are divided in 
two classes. Specifically, all but one node in each channel 
apply desynchronization so as to achieve TDMA within the 
channel (these nodes are denoted as “Desync”). Desync 
nodes operate only within their channel, firing and listening to 
messages from the other nodes in their channel. In addition, 
one “Sync” node per channel performs cross-channel syn¬ 
chronization to achieve a time-synchronous slot structure [Fig. 
Hb)]. The Sync node of each channel listens for the SYNC 
fire message in the next channel! A node can be designated as 
the Sync node in a channel based on a pre-established rule, 
e.g., the node with the smallest node ID, or the node with the 
highest battery level (all nodes can be made to report their 
node ID and battery status in their beacon messages). 

We highlight that the existence of a SYNC node in each 
channel calls for an iterative algorithm performed jointly 
across the available channels (see Section lIV-Bl i. This is 
fundamentally different from prior schemes, e.g., Q, which 
applied desynchronization in each channel independently. In 
contrast, cross-channel synchronization allows for a channel 
swapping mechanism to be applied in the converged state. 
Specifically, nodes (both of SYNC and Desync type) that 
fire synchronously in adjacent channels can swap channels and 
time-slots in pairs using a simple RQ/ACK schem^l [see Fig. 
Ob)]. Channel swapping allows for communication between 
nodes initially present in different channels without leaving the 
steady network state, thereby achieving increased connectivity. 
Conversely, in Q, when a node changes channels, conver¬ 
gence to TDMA in the channel needs to be established anew. 

According to our protocol, starting from any random state, 
the network reaches a steady state, where; (i) the same number 
of nodes is present in adjacent channels, (ii) the nodes in each 
channel have converged to a TDMA scheduling and (Hi) the 
nodes in channels with the same number of nodes have a 
parallel TDMA scheduling, where nodes allocated with the 
same time-slot order transmit synchronously [see Fig. [3b)]. 

B. Proposed Joint SynC-DeSYNC Algorithm 

We now describe the proposed joint algorithm that al¬ 
lows for synchronization of SYNC nodes across channels and 
desynchronization of Desync nodes in each channel. Let 0c,i 
(resp. (pci) denote the phase (resp. phase offset) of node i = 
1,... ,nc in channel c = 1,... ,C. Without loss of generality 
and to simplify notation, let the node i = 1 be the SYNC node 
in each channel! Desync nodes i = 2,..., ric in channel c 
are coupled with phase neighboring nodes (both Desync and 
Sync) in the same channel. Namely, any Desync node i in 
channel c updates its phase offset (j)c,i when node r — 1 in the 

^We co nsider a cyclic behavior between channels 1 and 16 of IEEE 802.15.4 
OH, on Namely, the SYNC node at channel 16 listens for the fire message 
from the SYNC node in channel 1. 

^Swap RQ/ACK packets are transmitted at another channel during a short 
interval after and before a node’s fire message transmission. 

'^As explained in Section liv-^ any node in a channel can be the SYNC 
node. This convention is only used to simplify our notation. 



Fig. 4. Example of the phase updates performed by the proposed multichan¬ 
nel MAC algorithm: In channel 1, the DESYNC (white) node 4 undergoes 
a phase update receiving coupling from nodes 1 and 3, present in the same 
channel. In channel 2, the SYNC (grey) node 2 does not receive coupling from 
the Desync node 4 that fires. In channel 3, the phase of the SYNC node 1 
is updated due to the firing of the SYNC node in channel 4. The firing of the 
latter node also triggers a phase update of the DESYNC node 2 in channel 4. 

same channel transmits a fire message, i.e., when 0c,z-i = 1- 
The Sync node in channel c, in turn, receives coupling only 
from the SYNC node in channel c+1 (channel 1 for c = C). 
Specifically, it updates its phase offset when the SYNC 
node in the next channel fires, that is, when 0c+i,i = 1- An 
illustrative example of the phase updates performed by the 
proposed algorithm is given in Fig. |4| 

Problem formulation: Inspired by the interpretation given 
in Proposition [T] we address the multichannel coordination 
problem by solving 

c 1 ^ ^ 

minimize ... ,cf>c) :=y^-||Dc^c-ln„-|-ec|L 

C—\ 

C ^ 2 

+ XI 2 , (20) 

C=1 ^ 

where (pc = {(j)c,i,<pc, 2 , ■ ■ ■ jpc.uc) S R"“ is the vector 
containing the phase offsets of all nodes of channel c, 
Dc € is the matrix of (|9]l with dimensions Uc x ric, 

Ec = (0,0,..., 1) G and Wc = (1,0, ...,0) G 1^""=. 
While the first term of h enforces desynchronization among 
the nodes of the same channel [note that each summand 
has the same format as g in (|8]i], the second term enforces 
synchronization among the first nodes of each channel. We 
remark that the second term of ( i20] l is commonly found in the 
design of optimization-based consensus algorithms iia, ca, 

ISl- 

Intuition: We show that the direct application of the gra¬ 
dient descent method to solve (i20l l leads to updates (for the 
Sync nodes) that cannot be implemented in a practical WSN. 
However, the proposed solution will be a modification of those 
updates. 

Taking into account that — One for c, the 

gradient of h with respect to <pc is given by 
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^0c ^(^ 1 5 * ■ ■ ; “1“ 

+ (2wjcl)c - Wc-l^(t>c-l - lOc+l^^c+l) , (21) 

where := (1,0,...,0,-1) G M"”. Therefore, the partial 
derivative of h with respect to (j)c,i is 


d 


-h{4>i,. .., 4>c) 


d 4*c,i 

400,2 0c,2 — 1 0c,2+1 0C—1,2 0C+1,2 5 t 1 

0c ,2 — 1 0c, 2+1 “f (^ 0)2 5 ^ 1 ; 

where (tic)i denotes the *-th component of dc- The gradient 
descent with stepsize /3 applied to (l20l l yields for node i of 


channel c: 


A(fe) 


- /3 


^ 0c,' 




. 4>c 


(fc-1)'. 


Replacing (3 with a/2, we obtain 


. 

( 22 ) 

for i = 1 , and 


0i!i^ = (1 - oi)4>% + 4>%+} - . (23) 

for i ^ The update of (l2Tl l is similar to the Desync 
algorithm phase update in (|4|i-(|6]l. However, the derived update 
for the Sync node, given in (l22l i. does not abide by the 
coupling rules mentioned in Section IIV-AI Specifically, to 
implement (l22li in a wireless transceiver, each SYNC node 
has to listen for fire messages in its own channel, as well 
as in the previous and the next channel. This is impractical 
with the half-duplex transceiver hardware in IEEE 802.15.4- 
based WSNs. This issue stems from the symmetry of the 
matrix D [cf. dTTll and (fTTll l. To alleviate this 

issue, we propose modifying directly the matrix associated 
with the iterations (l22T i and (l23T l. Our modification is based 
on the insight that there is one degree of freedom in each 
channel. Therefore, we can fix the phase of one of the nodes 
at an arbitrary value. Our approach is to modify (l22li and (l23T l 
to have the first nodes of each channel performing a simple 
consensus algorithm llJTl (while the remaining nodes perform 
a Desync algorithm). 

Multichannel Sync-Desync (MuCh-Sync-Desync): 

Eor simplicity and without loss of generality, we assume that 


= nc- The iteration we propose is 


Qi Q 2 0 
0 Qx Q 2 




1 

"^0 

_1 



0 0 

Q 2 0 


Qi Q 2 
0 •• Qi 


: n 

:= 77-1 = 7 

1 

r0i('=-i)i 


02^'=-^) 


-1 

7 

.. 

0 

_1 


= -.M 


-0 


(24) 


where 0 is the n x n zero matrix, e„ := ( 0 , 0 ..., 0 , 1 ) G M", 
Q 2 := Diag( 7 , 0,..., 0) G K”^”, 0 < 7 < 1, and Qx is the 
n X n matrix defined as 

'1 -7 0 0 0---0 O' 

P 1-20 P 0 • • • 0 0 

0, .= 0 P 1 - 20 0 • • • 0 0 


0 0 0 0 • • • 0 1 - 20 _ 

In other words, in each channel c, node i ^ 1 performs the 
update (I 23 I 1 , while node 1 performs 

= (1 - 7)0^+- 


,(fc-l) , Ak-l) 

+ 70C+1,1 


(25) 


Recall that the phase update of the SYNC node in channel c 
is performed when the SYNC node in channel c-f 1 fires, i.e., 
when 0c+i,i{tc+i,i) = 1. Adding in both sides of dZSl) 

as well as replacing 0C+1,2 — 1 — and using ([T]) leads to 

the following phase update for the SYNC node in channel c: 


0/^( 4 + 1 , 1 ) = (1 - 7)^c,i(fc+i,i) + 7 modi. (26) 

Since 0 < 0c,i(i) < 1, it is straightforward to show that, for 
0 < 7 < 1 , dThl l provides for inhibitory coupling between 
the Sync nodes in subsequent channels, thereby leading to 
synchronization of their phases. In the following proposition 
we establish that the update (l24li converges to a solution of the 
optimization problem (l20l i. In this case, however, we cannot 
obtain an explicit convergence rate. Note that the matrix M 
is not symmetric, which complicates the convergence analysis. 
Note also that, when the number of nodes per channel varies, 
the sizes of vectors 0 , e, and matrices Qx and Q 2 in (l24ll 
vary per channel c = 1,... ,C, but their format is the same. 
Moreover, the update equations, described in (l23T l and dZSl) 
remain the same. 


Proposition 2. Let 0 < 7 < 1 and 0 < 0 < 2. Then, the 
sequence produced by (EH) converges to a solution of (EOl). 

Proof: The proof is given in Appendix |B] ■ 

In MuCh-Sync-Desync— formed by (l23]) and (EH— the 
Desync and Sync nodes per channel update their phases 
only once during a firing round in the channel. Similarly 
to existing (de)synchronization algorithms, the role of the 
parameters a and 7 in the updates of (l2Tt and dZST l is to 
compensate for missed fire messages and to not allow their 
propagation throughout all nodes and channels in the network. 

Since the update of the Desync nodes in each channel fol¬ 
lows the phase update in (|4l)-(|6|, the corresponding Nesterov 
modification can be applied to speed-up desynchronization in 
each channel. This approach leads to the East-MuCh-Sync- 
Desync version of our algorithm, of which the convergence 
speed is assessed in the next section. 


V. Experimental Evaluations 
A. Simulation Results 

All simulations were performed in MATLAB, by extending 
the event-driven simulator in El- Initially, we examine the 

^Similar to other synchronization algorithms da, every time the SYNC 
node in channel c + 1 fires the SYNC node in the previous channel will 
increase its phase towards 1 according to (26). 
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Fig. 5. Average number of firing rounds for convergence to TDMA 
scheduling for Desync and the proposed Fast-Desync with the Nesterov 
modification: (a) n = 4 and (b) n = 8. 
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performance of Desync versus its fast counterpart based on 
Nesterov’s algorithm. Then, we assess the performance of 
the proposed MuCh-Sync-Desync algorithm and its fast 
version. We use two convergence thresholds, i.e., e = 10“^ 
and e = 10“^. Convergence is reported at the firing round 
where the phases 4> of the nodes minimize the objective 
function in (l8]l with accuracy g {cf)^ < e. Following existing 
desynchronization schemes m, CQi, our algorithms’ updates 
are performed on the nodes’ phases 9i, as Assumption 1 
does not need to be followed in practice. This simplifies the 
implementation, as we do not need to know the order of firings. 
All simulations were repeated 400 times and average results 
are reported. 

The results of applying desynchronization at a given channel 
using either Desync l2l, ifTOl or the proposed Fast-Desync 
algorithm are presented in Fig. |3a) and (b) for n = 4 and n = 
8 nodes, respectively. Although our analysis proves that Fast- 
Desync converges for a S (0, 0.5], convergence is actually 
achieved for a S (0,1). In fact, Fast-Desync systematically 
reduces the required number of iterations to convergence (i.e., 
irrespective of the value of the parameter a), leading to a 



DESYNC, £ = 10“* 
-FAST-DESYNC, £ = 10“* 
DESYNC, £ = 10“’ 
-FAST-DESYNC, £ = 10“’ 
-Bound FAST-DESYNC, £ = 10“* 

■ Bound FAST-DESYNC, £ = 10“’ 

■ Bound DESYNC, £ = 10“’ 
-Bound DESYNC, £ = 10“* 










n--®' 



Fig. 6. Maximum required firing rounds to convergence for DESYNC and 
Fast-Desync versus the corresponding upper bounds, n = 8. 


2.6%-28.6% speed-up with respect to Desync. Furthermore, 
the convergence speed-up increases when a strict threshold 
(e = 10“"^) is used. The improvement is more significant at 
low and medium values of a, which are typically used in 
practice to attenuate the impact of missed fire messages. 

Fig. |6] depicts the maximum number of required firing 
rounds for convergence of Desync and Fast-Desync versus 
the bounds in Corollaries 2 and 3. The difference between 
Desync and Fast-Desync is not visible now due to the 
logarithmic scale. Because of the low e value in the denomi¬ 
nator of (fTSi) the Desync upper bound appears to be loose. 
However, the Fast-Desync bound in (fT9l l offers a tighter 
characterization of the simulation-based convergence iterations 
and follows a trend very similar to the simulation results. 

We now evaluate the convergence properties of the proposed 
MuCh-Sync-Desync and its fast version. The results are 
given in Fig. |2ta) and (b) for Uc = 4 nodes per channel 
in C = 6 and C = 16 channels, respectively. Contrasting 
these results with the ones in Fig. |5] we observe that the 
proposed multichannel algorithm requires approximately only 
10-20% more firing rounds to reach convergence than the 
single-channel Desync algorithm. It is also worth noticing 
that the proposed Fast-MuCh-Sync-Desync version offers 
a notable convergence speed-up (i.e., 6.01%-42.54%) with 
respect to the simple MuCh-Sync-Desync algorithm, ir¬ 
respective of the number of channels. 


B. Experiments with TelosB Motes 

Experimental setup: We implemented the proposed 
MuCh-Sync-Desync and its Fast version as applications 
in the Contiki 2.7 operating system running on TelosB motes. 
By utilizing the Null MAC and NullRDC network stack 
options in Contiki, we control all node interactions at the MAC 
layer via our code. By utilizing the TelosB high-resolution 
timer (rtimer library), we can achieve the scheduling of 
transmission and listening events with sub-millisecond accu¬ 
racy, and set T = 100 ms. The phase-jump parameters are 
set as a = 7 = 0.6. All nodes first listen constantly until 
convergence is achieved in their channel, at which point data 
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(a) 



(b) 


Fig. 7. Average number of firing rounds for convergence to decentralized 
multichannel TDMA scheduling for the proposed MuCh-Sync-Desync 
algorithm and its fast counterpart; nc = 4 nodes per channel are considered 
with: (a) C = 6 and (b) C = 16 channels. 


transmission starts and nodes switch to sparse listening to save 
energy. Due to interference in the 2.4 GHz band of IEEE 
802.15.4 and timing uncertainties in the fire message broadcast 
and reception, we apply three practical modifications to ensure 
that, once the network reaches the steady state, it remains there 
until the entire network operation is suspended, or nodes join 
or leave the network; 


1) Each node can transmit data in-between its own fire mes¬ 
sage and the subsequent fire message from another node, 
albeit allowing for guard time of 6 ms before and after 
the anticipated beacon broadcast times; this ensures no 
collisions occur between data and fire message packets. 

2) In the steady state, each node turns its transceiver on 
solely for the 12 ms guard time corresponding to each 
beacon message. Moreover, all nodes switch to “sparse 
listening”, i.e., they listen for beacons only once every 


eight periods, unless high interference noise is detectecl^ 

3) To remain in sparse listening and avoid interrupting data 
transmission due to transient interference, all nodes are 
set to switch to full listening only if = 10 consecutive 
fire messages are missed. Our choice of TVc provides 
stable operation under interference at the cost of slower 
reaction time. 

As mentioned in Section IIV-AI once all nodes are activated, 
they are first balanced across the available channels. Note also 
that, although our time-synchronized slot structure provides 
channel swapping between synchronous nodes, this is not 
considered in the experiments. 

We select TSCH as benchmark for our comparisons, since 
it is a state-of-the-art centralized MAC protocol for densely- 
connected WSNs El, El. Our implementation follows the 
6 tisch simulator and TSCH standard El, El, ES, namely: 
channel 11 of IEEE 802.15.4 was used for advertisements, 
the RQ/ACK ratio was set to i, the slotframe comprised 101 
slots of 15 ms each, and one node was set to broadcast the 
slotframe beacon for global time synchronization. Einally, the 
WSN under TSCH is deemed as converged to the steady state 
when 5% or less of the timeslots changed within the last 10 
slotframes. 

Adhering to scenarios involving dense network topologies 
and data-intensive communications (e.g., visual sensor net¬ 
works EH), we deployed n = 64 nodes in the C = 16 
channels of IEEE 802.15.4. This leads to ric = 4 nodes per 
channel after balancing. The 64 TelosB motes were placed in 
four neighboring rooms on the same floor of an office building, 
with each room containing 16 nodes. 

Power dissipation results: We assessed the average power 
dissipation of our scheme against TSCH by placing selected 
TelosB motes in series with a high-tolerance 1-Ohm resistor 
and by utilizing a high-frequency oscilloscope to capture the 
current flow through the resistor in real time. During this 
experiment, no other devices (or interference signal genera¬ 
tors) operating in the 2.4 GHz band were present in the area. 
Average results over 5 min of operation are reported. The 
average power dissipation of MuCh-Sync-Desync without 
transmitting or receiving data payload was measured to be 1.58 
mW. The average power dissipation of a TSCH node under 
minimal payload (128 bytes per 4 s) was found to be 1.64 mW, 
which is very close to the value that has been independently 
reported by Vilajosana et al. El. Therefore, under the same 
setup, our proposal and TSCH were found to incur comparable 
power dissipation for their operation. 

Convergence speed results: We investigate the conver¬ 
gence time of MuCh-Sync-Desync, East-MuCh-Sync- 
Desync and TSCH under varying interference levels. Rapid 
convergence to the steady state is very important when the 
WSN is initiated from a suspended state, or when sudden 
changes happen in the network (e.g., nodes join or leave). We 
carried out 100 independent tests, with each room containing 
an interference generator for 25 tests. To generate interference, 

®In the converged state, each node determines the interference noise floor 
in-between transmissions by reading the CC2420 RSSI register. If high 
interference is detected, the node switches to regular listening. Thus, sparse 
listening does not affect the stability of MuCh-Sync-Desync. 
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Fig. 8. Average time required for MuCh-Sync-Desync, its FAST version, 
and TSCH to converge under various interference levels. 

TABLE I 

Average Convergence Time (in seconds) Under FIidden Nodes . 
Numbers in Parenthesis Show the Convergence time of the 
Fast (Nesterov-based) version of our proposal. 



MuCh-Sync-Desync 

TSCH 

Without Hidden Nodes 

1.1356 (0.7351) 

15.5845 

With Hidden Nodes 

1.8514 (1.2896) 

15.2957 


an RF signal generator was used to create an unmodulated car¬ 
rier in the center of each WSN channel. The carrier amplitude 
was adjusted to alter the signal-to-noise-ratio (SNR) at each 
receiver ll^ . The nodes were set to maximum transmit power 
(-1-0 dBm) in order to operate under the best SNR possible. 

Fig shows the time required for MuCh-Sync-Desync, 
Fast-MuCh-Sync-Desync and TSCH to converge under 
varying interfering signal power levels. The results corroborate 
that our proposal reduces the convergence time by an order 
of magnitude in comparison to TSCH and that the Nesterov- 
based algorithm offers 36.48%-41.07% increased convergence 
speed under a realistic setup. Moreover, the difference in 
convergence time between the proposed mechanism and TSCH 
increases with the interference level because TSCH nodes miss 
most of the RQ/ACK messages in the advertisement (control) 
channel. This result demonstrates the key advantages of our 
decentralized MAC mechanism with respect to TSCH, namely: 
(i) it is fully decentralized and (ii) it does not depend on an 
advertisement and acknowledgement scheme. 

Results under hidden nodes: We now investigate the 
robustness and convergence speed of our scheme when some 
nodes in the WSN are hidden from other nodes. We measure 
the time to achieve convergence to steady state when a random 
subset of 20 nodes in our WSN setup was programmed to 
ignore transmissions from 4 randomly chosen nodes. The 
results in Table J] show that, irrespective of the presence 
of hidden nodes, the convergence of MuCh-Sync-Desync 
and its FAST version is an order-of-magnitude faster than 
that of TSCH. When hidden nodes are present, the required 
convergence time of MuCh-Sync-Desync (resp. its Fast 
version) increases by 63.03% (resp. 75.43%), while that of 
TSCH is actually sightly decreased by 2.13%. This is to be 



Fig. 9. Total network throughput between MuCh-Sync-Desync and TSCH 
under varying signal power levels. 

expected, as TSCH nodes simply ignore RQ packets from 
hidden nodes. Conversely, due to the Desync (resp. Fast- 
Desync) process within each channel, applied by MuCh- 
Sync-Desync (resp. its Fast version), prolonged beaconing 
will take place until all hidden nodes are placed amongst non- 
hidden Desync phase neighbors. This spontaneous robustness 
of MuCh-Sync-Desync (and its Fast version) to hidden 
nodes is an interesting property that deserves further studjQ. 

Bandwidth results: We measure the total network through¬ 
put (i.e., total payload bits transmitted by all nodes per second) 
achieved with MuCh-Sync-Desync and TSCH under var¬ 
ious interference levels. Since the measurement is performed 
after the network is converged, the throughput of MuCh- 
Sync-Desync coincides with its fast version. The results 
in Fig |9] show that MuCh-Sync-Desync systematically 
achieves substantially higher network throughput (more than 
40% increase w.r.t. TSCH), irrespective of the interference 
level. Both protocols suffer a signihcant throughput loss 
of under high interference (i.e., above 10 dBm), which is, 
however, substantially more severe for TSCH. In effect, when 
interference is above 12 dBm, the bandwidth obtained with 
TSCH drops to zero because of the inability to recover 
lost slots through advertising. Conversely, even under high 
interference levels, MuCh-Sync-Desync recuperates band¬ 
width utilization due to the elasticity of Sync and Desync 
mechanisms and the high value used for N^. 

VI. Conclusion 

We have shown that Desync, which is a well established 
desynchronization method for MAC layer coordination in 
WSNs, can be viewed as a gradient method for solving 
an optimization problem. This interpretation led to a novel, 
faster desynchronization algorithm (based on Nesterov’s mod- 
ihcation of the gradient method) and resulted in the deriva¬ 
tion of upper bounds for the convergence of desynchroniza¬ 
tion. Importantly, casting the problem of time-synchronous 
desynchronization across channels as a convex optimization 

^ For instance, one can try to determine conditions that guarantee that no 
configuration of hidden nodes can lead to instability. 
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problem, led to the derivation of novel multichannel MAC 
algorithms. Our proposed MuCh-Sync-Desync algorithm 
and its fast counterpart were benchmarked against the IEEE 
802.15.4e-2012 TSCH and were shown to provide for: (i) 
an order-of-magnitude decrease in the convergence time to 
the network steady state, (ii) more than 40% increase in 
the total network throughput, and (Hi) significantly-increased 
robustness to interference and hidden nodes in the network, 
while requiring comparable power dissipation. 


(30) 

To find a worst case scenario, we maximize (l30l l with respect 
to subject to the constraints 0„ < < 1„. This is a 

non-convex problem, but the solution can be found in closed- 
form with the following observation. Since 4- (l/n)l„l^ 
is a circulant matrix and its entries are all positive, maximiz¬ 
ing (l30b subject to 0„ < < 1„ is equivalent to 


Appendix A 

Proof of Corollary Q} It is known that every limit point 
of the steepest descent method with a constant stepsize /3, i.e., 
—/3 is a stationary point of p(0) 

whenever Vp is Lipschitz continuous, i.e., there is an L > 0 
such that IIVp(y) — V 5 (a;)|| < L||y — a;|| for all x, y € R”, 
and (3 S (0, 2/L); see ll3^ Prop.1.2.3]. In problem ®, g is 
twice differentiable, and V'^g{cf)) = D, for all (p. We can 
then set L > Amax(£^^£)), where Amax(-) is the maximum 
eigenvalue of a matrix. Notice that, for D in D 

coincides with the Laplacian matrix of the ring graph, whose 
eigenvalues are given by 2 — 2cos(27rfc/n), k = 1,... ,n ll40l 
Lemma 2.4.4]. We then have 

Amax(V^( 7 (</>)) = argmax 2 — 2cos (27rfc/n) <4. (27) 

k 

Setting L = 4, and taking into account that a = 2/3, we 
obtain that Desync converges whenever a G (0,1). Notice 
that when n is even, the maximum is achieved in ( l27T i. i.e., 
Amax(V25(0)) =4. ■ 

Proof of Corollary |2} Let g : R" —>■ R be a convex, 
continuously differentiable function whose gradient is Lips¬ 
chitz continuous with constant L. It is known that the se¬ 
quence generated by the steepest descent method with constant 
stepsize /3 S (0,2/L), i.e., — /3 

satisfies ll29] Thm.2.1.14] 


- 9 { 4 >*) 

< 2igiP^°^)-giP*))\\P^°^-P*M (28) 

■ 2||0(°) - + A:/3(2 - L/3)(/7(c/>(°)) - g(^p*)) ’ 

where p* is any minimizer of g. As shown in the proof of 
Corollary [T] L = 4 in our case. Eurthermore, g{p*) = 0. 
Taking this into account in (I28I) . using a = 2/3, and after 
some manipulations, we get dl. 

To obtain (fTSi) . we note that (IT4l i holds for any solution p* 
of ([8]l. That is, 

2a(l-a)(e 

(29) 

where S* is the set of all solutions of ([8]l. We have S* = {p+ 
zln : z S R}, where p is any solution of (l8]|. Henceforth, 
we will take p = (0,1/n, 2/n,..., (n — l)/n). Then, the 
minimization problem in (|29] | is equivalent to the minimization 
of \\p + zln — p^^^W over z, which yields z* = ^l!n{p— 
p^^'^). Hence, 


To < 


min ||<^*'°^ — ^*||2 


min Wp'^^^ — p*^^ 


P+^ll{P-P^°))ln-P^^'> 


(0) 


maximize P — P 

</)(0) 

subject to On < < 1 


(31) 


Since p = (0,1/n, 2/n,..., (n — l)/n), the solution of (ISTl) 
is = (1,1,..., 1,0, 0,..., 0), where the transition from 1 
to 0 occurs at the first index m where m > n/2. Denoting 


B := max min \\p^^'^ — p *\\2 
S.t. 0„ < P^°^ < In 


we have 


B <2 


m —1 


n—1 




^ r m — l 


2 


i—m 

n—1 




i=0 


— nm{m — 1) -|- 


i{n — l)(2n — 1) 


= — Tbnm — 6m^ -I- 6m -I- 2n^ — 3n -b ll 
3n ■' 


1 

< — 

3n 


7 , 

-n -b 3n -b 4 


(32) 

(33) 

(34) 

(35) 

(36) 


The bound in (|32] | is due to replacing = 

(1,1,..., 1,0, 0,..., 0) in ® and using || (J„-bil„l^)(^- 

P^°^)\\l < {pn\\l + {l/n^)\\lnll\\;)\\P - P^°'>\\1 = 

2\\p — Erom (d to (l34l i. we developed the 

square in the first summand and used the identities 
* = w(m — l)/2 and YlllZi — l)(2n — l)/6. 

Erom (l35T l to (l3^ . we used the bound nl2 < m < (n-b l)/2- 
Using (d in ( |29] | we get (fTSl l. ■ 

Proof of Corollary Equations (I17al i- (ll7bl i are applying 

Nesterov’s method (I16al) - (ll6bl) to problem ® with a = 2/3. It 
is known that the number of iterations that (I16al i- (ll6bl i requires 
to generate a point p that has accuracy e = g{p) is bounded 
as d 


?'FD < 


|U(0) 

y/e-g{p*)'' 



(37) 


where p* minimizes g. This expression is valid for /3 S 
(0,1/T], where L is the Lipschitz constant of Vg. We saw 
in the proof of Corollary [T] that L = 4 is a valid choice. 
Since g{p*) = 0 for any optimal p*, and using a = 2/3 
in d, we get (fTsT l. To obtain (fT^ from (fTsl l. we use (l3^ 
from the proof of Corollary |2] ■ 
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Appendix B 

Proof of Proposition^ If converges, its limit will 

be a fixed point of (l24l i. Before showing that } converges, 
we show that any hxed point of (l24l i solves (l20l i. Let 0* = 
(</)*, (/)2i ■ • ■ I ‘/’c) be a hxed point of (l24li . For each c, we have 

4>*c,i = (1 - lWc,i + 7 4>*c+i,i , * = 1 (38) 

4’*c,i = P 4>*c,i-l + (1 ~ ‘^P)4’c,i + P 4‘c,i+l + P{^n)i , t ^ 1 ■ 

(39) 

From ( l38l l. and since 7 > 0, we have 4>*i= fc+i 1, for all c 
modulo C. This makes the second summation term in (l20l l 
equal to zero, that is, cf)*, for all c. 

From ( |39] |. and since /3 > 0, we have 


There is exists a permutation matrix P such that 


MP = 


0 


Cx{n-l)C 


where 


R 


R = 


1 — 7 7 0 

0 1 — 7 7 


7 0 0 • • • 1 — 7 


pCxC 


+' 


c,i+l 


i = 2, 3,..., n - 1 (40) 


1 


(41) 


These equations are equivalent to i+i — i = 1/tr, for i = 
1 ,..., n — 1, and 4>* i + ^ — (f’c n = ^nd this makes 

the hrst term of the objective of ( |20] | equal to zero. To see 
why the above equivalence holds, note that dTOl - dTlT i imposes 
that all n — 1 phases 4>c n be placed in the 

interval + !]■ Furthermore, each phase has to equal 

the average of the previous phase with the next phase, where 
the phase previous to f* 2 is </)* 1 and the phase next to 0c, n 
is 0* + 1. The only possibility is all phases, including the 
extreme points, being equispaced. 

We now prove that {0^^^} converges. Writing dH in a 
more compact form. 


+6. (42) 


It is known that the sequence {0^^^} produced by (l42T i 
converges to {I — M)~^b whenever the spectral radius of M, 
denoted as p{M), is strictly smaller than 1 1411 §1.2]. In our 
case, however, 1 is an eigenvalue of M, so p{M) > 1. 
By computing all the eigenvalues of M, we will see that 
actually p{M) = 1. Before proceeding, note that the vector 
of ones, InC, is a right eigenvector of M associated to the 
eigenvalue 1, and u := (ei, ei,..., ei) S is a left 

eigenvector of M also associated to the eigenvalue 10 To 
compute the eigenvalues of M, first decompose Qi as 


Qi 



oLi' 

T 


Such permutation matrix corresponds to a reordering of the 
nodes such that 0 is mapped onto 

(01,2,01,3, • ■ • , 01,n, 02,2, 02,3, ■ • ■ , 02,n, ■ • ■ , 

01,1,02,1, • ■ • , 0C.l) , 

that is, the first nodes of each channel are in the end of the 
vector in the new coordinate system. The matrices M and 
P^MP have the same eigenvalues. The upper triangular 
structure of P^MP reveals that its eigenvalues are the 
roots of det(r ~ A/)‘^det(i2 — XI) = 0, where I is an 
identity matrix with appropriate dimensions. In other words, 
the eigenvalues of M are the union of the eigenvalues of T, 
each with multiplicity C, with the eigenvalues of R, each with 
multiplicity 1. Since T is tridiagonal Toeplitz, its eigenvalues 
are Xj{T) = 1 — 2/3 + 2/3 cos(^j), for j = 1,..., n — 1 133 
p.514]. The matrix R, on the other hand, is a circulant 
matrix and hence its eigenvalues are the Fourier transform 
of the vector that generates the matrix. In this case, they 
are Xj{R) = 1 - 7 + 7exp(^j), for j = 
where i := \/—l. Since 0 < 7 < 1, i? has one eigenvalue 
equal to 1 (multiplicity 1) and the remaining ones have mag¬ 
nitude smaller than 1. As 0 < /3 < 1/2, all eigenvalues of T 
have magnitude smaller than 1. We conclude that p{M) = 1, 
and that its algebraic (and geometric) multiplicity is 1. 

Define M := M — Incu"^- Then, p{M) < 1 ll42l Lemma 
8.2.7], and (l42l l can be written as 

0('=( = + b. (43) 

Since u^M — u and u^b = 0, (l42l) tells us that = 

M In particular. 


where r = (/3, 0,..., 0, /3) S 

‘1-20 /3 

0 1-2/3 


T = 


R" and 

0 ••• 

/3 ••• 


0 0 
0 0 


w^0('=) = = .. • = w^0(i) = «^0(°). 

Defining h = 6 + Incu^4’^^\ (l43t can then be written as 

0('=^ = -fh, (44) 


_ 0 0 • • • /3 1 - 2/3_ 

^If the Pen'on-Frobenuis theory 1421 . 1431 were applicable, we would 
conclude that p{A<f) = 1, and that the eigenvalue 1 would have algebraic 
multiplicity 1. This would enable us to skip the computation of all eigenvalues 
of Af and jump to the next paragraph. However, the Perron-Frobenuis theory 
is not applicable, since AI, although being positive, is not iiTeducible. 


where p{M) < 1. Thus, according to BTl §1.2], the se¬ 
quence {0*^^^} produced by (l44l i. and thus by (l24l) . converges 
to (J - M)-i6 = (J - +{I - 

which is well-dehned and unique (note that I—M is invertible 
because p{M) < 1). This shows that the sequence {0^^^} 
converges. ■ 
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